Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT On Row Mean Of Spectrogram For Speaker Identification

نویسندگان

  • H. B. Kekre
  • Tanuja Sarode
  • Shachi Natu
  • Prachi Natu
  • Tanuja K. Sarode
  • Shachi J. Natu
  • Prachi J. Natu
چکیده

The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performance of this approach is compared for different number of coefficients of DCT when DCT is applied on entire spectrogram, when DCT is applied to spectrogram divided into blocks and when DCT is applied to the Row Mean of a spectrogram. Performance comparison shows that, number of mathematical computations required for DCT on Row Mean of spectrogram method is drastically less as compared to other two methods with almost equal identification rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Identification Using 2-d Dct, Walsh and Haar on Full and Block Spectrogram

This paper aims to provide different approaches to text dependent speaker identification using DCT, Walsh and Haar transform along with use of spectrograms. Spectrograms obtained from speech samples are used as image database for the study undertaken. This image database is then subjected to various transforms. Using Euclidean distance as measure of similarity, most appropriate speaker match is...

متن کامل

Speaker Identification using Row Mean of Haar and Kekre’s Transform on Spectrograms of Different Frame Sizes

In this paper, we propose Speaker Identification using two transforms, namely Haar Transform and Kekre’s Transform. The speech signal spoken by a particular speaker is converted into a spectrogram by using 25% and 50% overlap between consecutive sample vectors. The two transforms are applied on the spectrogram. The row mean of the transformed matrix forms the feature vector, which is used in th...

متن کامل

Speeding Up Music Similarity

Differences to [1] A. fast spectral similarity: adaptive k-means (≤ 30 clusters), centers of the clusters are used as sample [2], likelihood of sample is computed as suggested in [3], with the modification that samples are weighted according to their priors B. Mel spectrogram (before DCT) used for fluctuation patterns C. 12 instead of 20 frequency bands used for the fluctuation patterns D. magn...

متن کامل

Accurate Fruits Fault Detection in Agricultural Goods using an Efficient Algorithm

The main purpose of this paper was to introduce an efficient algorithm for fault identification in fruits images. First, input image was de-noised using the combination of Block Matching and 3D filtering (BM3D) and Principle Component Analysis (PCA) model. Afterward, in order to reduce the size of images and increase the execution speed, refined Discrete Cosine Transform (DCT) algorithm was uti...

متن کامل

Speaker Identification using Spectrograms of Varying Frame Sizes

In this paper, a text dependent speaker recognition algorithm based on spectrogram is proposed. The spectrograms have been generated using Discrete Fourier Transform for varying frame sizes with 25% and 50% overlap between speech frames. Feature vector extraction has been done by using the row mean vector of the spectrograms. For feature matching, two distance measures, namely Euclidean distanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010